BlogTrackers: A Tool for Sociologists to Track and Analyze Blogosphere

نویسندگان

  • Nitin Agarwal
  • Shamanth Kumar
  • Huan Liu
  • Mark Woodward
چکیده

We present a tool BlogTrackers, which assists sociologists to track and analyze blogs of particular interests by designing and integrating unique features. We present an overview of BlogTrackers, illustrate its functions of various components of BlogTrackers, and outline future work for expansion in meeting the growing needs of sociologists. Introduction Blogosphere, the network of blogs, is growing at a phenomenal rate. Technorati has indexed around 133 million blog records. Sociologists are interested in studying the blogosphere for tracking socio-behavioral patterns, identifying the influential people in the region of interest and tracking interesting activities. They often have to eyeball the sites for useful information. Given a gamut of interests in the blogosphere, this can be a tedious and time consuming task. Through this user-oriented application, we propose to alleviate this problem by assisting them in effectively tracking and analyzing blogosphere. BlogTrackers grants sociologists the freedom to choose the blog sites they wish to analyze, observe interesting events and patterns with the flexibility of drilling-in. The tool consists of a number of analyzing and crawling modules and is a convenient alternative to eyeballing the blog sites and concentrate efforts on further analysis. Most tools are generic in nature and cannot be directly used by sociologists and others with specific needs. BlogTrackers is particularly designed for their needs that can perform both data collection and provide convenient visualizing tools to analyze the data. Table 1 presents a comparison of BlogTrackers with some of the existing tools in the domain. Although, sites like Technorati and BlogPulse provide features similar to our tool, they cannot be directly used. BlogTrackers combines them in a unique manner to maximize the analytical capability of the individual techniques. Apart from these tools there is also some generic visualization software that do not target blogs per se but can be used to do some analysis on the blog data. Pajek is a visualization tool that can be used to visualize the network !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! 1 http://technorati.com/blogging/state-of-the-blogosphere/ Copyright © 2009, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. data in various ways. IBM’s ManyEyes is another interesting project on generic visualizations but suffers from scalability issues. The Prefuse visualization toolkit contains a set of unique visualizations for the data. BlogTrackers BlogTrackers is a Java based desktop application that provides a unified platform for the user to crawl and analyze blog data. It grants the user, the freedom to choose the data of interest and helps in effectively analyzing it. The data is stored in a relational database. Currently we are tracking 10 different data sources like Twitter, Engadget, The Unofficial Apple Weblog (TUAW), LiveJournal, etc.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing Shift in Narratives Regarding Migrants in Europe via Blogosphere

Social media is widely used by individuals to express their views or opinions with others. Social media users leverage this platform to further their views by framing narratives and participating in online discourse. Nowadays almost all events, issues, and crises are discussed on social media. Blogs are not regulated by any authority and have no limit on the number of characters unlike other so...

متن کامل

Convergence of Influential Bloggers for Topic Discovery in the Blogosphere

In this paper, we propose a novel approach to automatically detect “hot” or important topics of discussion in the blogosphere. The proposed approach is based on analyzing the activity of influential bloggers to determine specific points in time when there is a convergence amongst the influential bloggers in terms of their topic of discussion. The tool BlogTrackers, is used to identify influenti...

متن کامل

Overview of the TREC 2009 Blog Track

The Blog track explores the information seeking behaviour in the blogosphere. Thus far, since its inception in 2006 [9], the Blog track addressed two main search tasks based on the analysis of a commercial blog search engine: the opinion-finding task (i.e. “What do people think about X?”) and the blog distillation task (i.e. “Find me a blog with a principal, recurring interest in X.”). In TREC ...

متن کامل

On the TREC Blog Track

The rise of blogging as a new grassroots publishing medium and the many interesting peculiarities that characterise blogs compared to other genres of documents opened up several new interesting research areas in the information retrieval field. The Blog track was introduced in 2006 as part of the renowned Text REtrieval Conference (TREC) evaluation forum, to drive research on the blogosphere an...

متن کامل

Overview of the TREC-2010 Blog Track

• Top stories identification: A task that addresses news-related issues on the blogosphere, namely investigating whether the blogosphere can be leveraged to identify the top news stories of a given day in a real-time fashion. The task has also a search diversity flavour, where for a given story, a representative set of blog posts discussing the story from various perspectives [7] is shown to th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009